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ABSTRACT 



The present invention concerns a parallel multiprocessor- 
multidisk storage server which offers low delays and high 
throughputs when accessing and processing one- 
dimensio naj and multi- d imensiona l fil e data s uch as pixmap 
images, text, sound or graphics. The invented parallel 
multiprocessor-multidisk storage server may be used as a 
server offering its services to computer, to client stations 
residing on a network or to a parallel host system to which 
it is connected. The parallel storage server comprises (a) a 
server interface processor interfacing the storage system 
with a host computer, with a network or with a parallel 
computing system; (b) an array of dis k nodes, each disk 
node be ing composed bv one processor electri cally con- 
nected to at least one disk an d (c) an interconnection 
network for, cpnnecting the server interface proce ssor with 
the array of disk- pades. Multi-dimensional data files such as 
3-d images (for example tomographic images), respectively 
2-d images (for example scanned aerial photographs) are 
segmented into 3-d, respectively 2-d file extents, extents 
being striped onto different disks. One-dimensional files are 
segmented into 1-d fi le extents. File extents of a given file 
may have a fixed or, a_ variable size . The storage server is 
based on a parallel image and multiple media file storage 
system. This file storage system includes a file server 
process which receives from the high leyej^stprage_ server 
pi9^SS_file_creatign, file opening, file closing and file 
deleting commands. It further includes extent serving pro- 
cesses running on disk node processors, which receive from 
the file server process commands to update directory entries 
and to open existing files and from the storage interface 
server process commands to read data from a file or to write 
data into a file. It also includes operation processes respon- 
sible for applying in parallel geometric transformations and 
image processing operations to data read from the disks and 
a redundancy file creation process responsible for creating 
redundant parity extent files for selected data files. 

27 Claims, 9 Drawing Sheets 
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METHOD AND APPARATUS FOR A 
PARALLEL DATA STORAGE AND 
PROCESSING SERVER 

Matter enclosed in heavy brackets [ ] appears in the 
original patent but forms no part of this reissue specifi- 
cation; matter printed in italics indicates the additions 
made by reissue. 

BACKGROUND OF THE INVENTION 

Professionals in various fields such as medical imaging, 
biology and civil engineering require rapid access to huge 
amounts of pixmap image data files. Today's acquisition 
devices such as video cameras, still image cameras, medical 
image scanners, desktop scanners, graphic arts scanners are 
able to generate huge quantities of pixmap image data. 
However, existing desktop computers and workstations do 
not offer sufficient storage bandwidth and processing capa- 
bilities for fast browsing and zooming in large pixmap 
images stored on disks and for applying geometric transfor- 
mations and image processing operations to large image 
files. Pixmap image data has to be stored, made accessible 
and processed for various purposes, such as fast interactive 
panning through large size images, image zooming For 
displaying large size images in reduced size windows, image 
browsing through sequences of independent images, access 
to video sequences and sound streams, extraction and trans- 
formation of given image parts. 

File data may consist of 2-dimensional images (for 
example aerial photographs), 3-dimensionaI images (for 
example tomographic scans, video sequences, sets of 
2-dimensional images), or one-dimensional data of a spe- 
cific media (for example sound, text, graphics). File data 
further comprises compressed images, compressed sound or 
compressed text. File data also comprises one dimensional, 
2-dimensional and 3-dimensional arrays of elements which 
are of different nature but can be assimilated to arrays of 
pixels. 

Various configurations of prior art computers and disks 
can be used for storing pixmap images and multiple media 
data. Single disk systems are too slow to provide the 
bandwidth necessary for fast browsing through large images 
or for accessing high-quality video image streams. Disk 
arrays such as redundant arrays of inexpensive disks, known 
as RAID systems [ECHEN90], can be used to increase the 
data bandwidth between mass storage and CPU, but a single 
CPU driving a disk array does not offer sufficient processing 
power to apply the image access and processing operations 
required for example for panning with a limited size visu- 
alization window through large images, for displaying 
reduced views of large images in limited size visualization 
windows or for applying transformations to given image 
parts. 

The presendy invented multiprocessor-multidisk storage 
server presents a cheaper and more powerful alternative for 
storage and processing of large files such as 2-d and 3-d 
pixmap image files, video sequences, sound, text and com- 
pressed media data (images, video, sound and text). It may 
be used as a local network server, as an ATM broadband 
ISDN server, as a powerful backend server of a host com- 
puter or as a storage server For a parallel system. 

For the description of the invention, the following termi- 
nology is used. The invented server architecture has been 
created primarily for the storage of image data. Therefore, 
the underlying parallel file system is explained by showing 
how image files are stored and accessed. Nevertheless, the 



38,410 E 

2 

concept is more general, and data files which are not images 
can also be stored, accessed and processed on the invented 
storage server. The meaning of pixels is generalized to 
information elements composed of a given number of bytes. 

5 The meaning of pixmap images is generalized to arrays of 
information elements. Furthermore, the concept of pixmap 
images, which is generally used in the context of 
2-dimensional arrays of pixels, is generalized to the third 
dimension. A 3-dimensional pixmap image is therefore 

no defined as a 3-dimensional array of pixels. Pixels are rep- 
resented by at least one byte. Data files of any kind may be 
segmented into extents, extents being one-dimensional for 
one-dimensional files, 2-dimensional for 2-dimensional files 
and 3-dimensional for 3-dimensional files. Extents are the 

is parts of a file which may be striped onto different disks at file 
storage time. Data files include the data as well as metadata 
associated with the file. For example, an image file includes 
pixmap image data and metadata specifying various char- 
acteristics of the image file, such as its size in each 

20 dimension, the size of its extents in each dimension and its 
colour palette. The metallic of a compressed image file may 
also contain a table giving the effective size in bytes of each 
compressed extent. 

Accessing rectangular windows from large image files is 

25 a frequent operation, image windows are defined as rectan- 
gular windows containing an integer number of pixels in 
each dimension. Image window boundaries may be located 
at any pixel boundary. W hen an image file is segmented int o 
extents T ali^n ^d imaffe windows are defined as the subset o f 

30 wi ndows whose boundaries coincide with extent boun d- 
aries;. 

File storage and access operations are used as a general 
term for accessing file data. Such accesses comprise data 
access to image file windows useful for panning purposes 
35 and subsampling operations useful for producing scaled- 
down rectangular image windows displayable in reduced 
size visualization windows. Both image file window data 
extraction and subsampling operations require proc essing 
p ower, given in the present apparatus by the parallel pro- 

40 c essjnjL nOWPr nf riiskui Qde_JMPi£ ^rc which are rE^THhftH 

in more detail belo w. 

Prior art methods of storing and accessing large sets of 
pixmap image files are based on high-performance worK- 

45 st ations accessi ng^arraysjxLdisks. They do no t offer the 
mea ns to CQntroJUhe^dis trito the 
disks. Furthermore, the workstation's CPU does not offer 
suffi cient process ing power to scale down large imagejiles 
^ high-sp^i i" "rrW t p display them in limited si ze 

5Q vi sualization windows or to ap pl y to thenLgeometric t rans- 
formations such as rotations. The presently invented data 
storage apparatus is based on disk nodes, each disk node 
being compo sed by one processor electrically connected to 
at least one dis k. An array built of such closely coupled 

55 processor-disk nodes offers both high disk throughput and 
highly usable parallel processing power. The invented par- 
allel file storage and access method described below pro- 
vides efficient distribution of files onto disks and high-speed 
access to requested file windows. 

60 SUMMARY OF THE INVENTION 

The present invention concerns a parallel multiprocess or- 
miiltidjsk-slo rage server which offers low delays and h igh 
throughputs when accessing one-dimensional and multi- 
65 dimensional file data such as p ixmap images, text, sound o r 
graphics. Multi-dimensional data files such as 3-d images 
(for example tomographic images), respectively 2-d images 
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(for example scanned aerial photographs) are segmented 
iqtn ^-dj respec tively 2-d^file extents, each extent possi bly 
bei ng stored on a different dist_ . One-dimensional files (for 
example sound or text) are segmented into one-dimensional 
extents. 

The invented parallel multiprocessoT-multidisk storage 
server may be used as ajrerver offering sftrvjcg* to a 
c omputer tojwhic fr it i&^nnected-J x_.client statio ns residing 
o n a network to jwMc hJftas^CQn qected, or to a parallel host 
system to which it is connected. 

The parallel storage server comprises 

(a) a server interface processor interfacing the storage 
system with a host computer, with a network or with a 
parallel computing system; 

(b) an array of disk nodes, each disk node being composed 
by one processor electrically connected to at least one 
disk; 

(c) an interconnection network for connecting the server 
interface processor to. the array of disk nodes. 

The parallel storage server runs a server interface process 
expecting serving requests from client processes, a file 
server process, and extent server processes responsible for 
data storage and access as well as additional processes 
responsible for geometric transformations and image pro- 
cessing operations, for creating r edundancy file s and for 
recovering Jiles in cases i of single. disk crash es. 

T^ne storage server is based on a parallel multi- 
dimensional file storage system. This file storage system 
incorporates a file server process which receives from the 
storage server interface process file creation, file opening, 
file closing and file deleting commands. It also incorporates 
extent serving processes which receive from the file server 
process commands to update directory entries and to open 
existing files and receive from the server interface process 
commands to read data from a file or to write data into a file. 
It further incorporates operation processes responsible for 
applying in parallel geometric transformations and image 
processing operations to data read from the disks. It also 
incorporates re^uridancy_file_creation processes responsible 
for creating realu_gc_arit4_arity w extent files for selected data 
files. 

When acting as a host backend server, as a ne tworkserver 
or as a parallel comp uter storage server, the server interface 
process ranging on the server interfac eprocessor receives 
da ta access and processing requests, interprets them and 
decomposes^theri}_into_file_l ey.el requests (for example 
creation, opening, reading, writing and deleting) or into file 
operation requests (for example geometric transformations, 
imagejjroces sinq operations) . In the case of read requests, 
the file system interface library decomposes these requests 
into ex tent read requests and transmits them to the extent 
server processes. It waits for the required extent data fr om 
th e disk nodes, asse mblesi Linto the -req uiredjj ata_window 
an d transmits it to the server interface process which for- 
wards the data to the client process located on the client 
computer. 

When attac hed to a parallel host system, the storage server 
made of intelligent disk nod es interacU- direcUvL-adth-host 
processes. running-on_the_paralleLsystem. _PI ost-processes 
ma y simultaneously access either different files.Qr_the.same 
file thr ough the file system interface libraries and a sk for 
arbitrarily sized da ta wind pws. 

"ExTe^tserving processes running on disk node processors 
are responsible. For* managing the free blocks of their 
connected storage devices, for maintaining the data structure 
associated with the fite~extents-stored _on their d isks, for 



reading extents from disks, For writing extents to disks, and 
for maintaining a local extent cache offering fast access to 
recently used data. At imageji ccess time, extent se rver 
processes arejesponsible for , accessing_ext ejts covered Jb y 
5 the req uired visualization winda w^e xtracting the visuali za- 
tioirwir^^ 

t he storage server interface, processp^ Jip the case of a 
zooming operation, the extent server processes subsample 
the original data in order to produce an image aUhe req uired 

10 r ejjucedUiskr Pisk node processors may run slave operation 
processes used for applying geometric transformations such 
as rotati ons or other ima ge_ processing operations to lo cally 
re trieved jrnage_4iajls. 

The invented parallel file storage server distinguishes 

15 itself From the prior art by the Following features: 

(1) its file storage system runs on a multiprocessor- 
multidisk platform and offers 2-d and 3-d image storage 
and access services requiring simultaneous disk 
accesses and processing operations such as high-speed 
panning and zooming in sets of large images; 

(2) it comprises information about file sizes in each 
dimension, about how files are segmented into 
Jj*_imejisjonal, 2-dj mensional or 3- dimensio nal extents 
and about how_e xtentsare distributed on a sufjset of t he 
a vailable disk s; 

(3) it provides library procedures for parallel application 
of geometric transformation and processing operations 
to data striped on multiple disks; 

(4) disk node processors combine parallel file extent 
accesses from disks and application of the required 
geometric transformation and processing operations. 

The invented parallel file storage server if Further char- 
acterized by the Following Features: 

(1) it offers services for creating multidimensional (1-d, 
2-d, 3-d) image and multiple media Files in addition to 
conventional files; 

(2) it comprises a f ile server process r esp onsible fo r.global 
op erations such, as file creation, file openi ng and file 
deleting- 

(3) it comprises extent server processes responsible for 
accessing extents^stored ontheir loca l disks, for man- 
aging a local extent cache and for interacting with the 
file server to create or delete directory entries; 

(4) it comprises operation processes running on disk node 
processing units capable of applying geometric trans- 
formations or image processing operations in parallel to 
image parts read from the disks, thereby speeding up 
processing time by a significant Factor; 

(5) it comprises redundancy file creation processes 
capable of generating redundancy Files From given 
data files; 

(6) it comprises a recovery process capable of recovering 
files in cases of single disk crashes. 

The invented parallel image and multiple media server 
offers the following advantages over the prior art: 

(1) disk node processing units are located close to the 
disks enabling disk file accesses and processing opera- 
tions to be closely combined (for example pipelined); 

(2) the number of disk node processing units and of disks 
attached to each disk node processing unit can be 
independently chosen and adapted to the requirements 
of an application; 

(3) due to the fact that the parallel server knows about the 
dimensionality of an image file and about its access 
patterns (i.e. for example access to rectangular win- 



20 



25 



30 



35 



40 



45 



50 



60 



65 



04/20/2004, EAST version: 1.4.1 



US RE38,410 E 

5 6 

dows in large image files) and that the parallel file necting the server interface processor to the array of intel- 

system supports multi -dimensional files segmented into ligent disk nodes. 

multi dimensional extent s, it is able to segment the file jh e server interface processor 12, the interconnection 

into extents in such a way that file parts are accessed on network and the disk node processors 13 are made of 

a multiprocessor-multid isk system Jn a moTTehici ent 5 commercially available components, such as iSSO or T9000 

wav thaaAwi lh a conventional file system using a RAID transputers, communication links and possibly a communi- 

disk array as its storage device; cat i 0 n crossbar switch like the Inmos C104 transputer link 

(4) Since extents of compressed image files are indepen- crossbar switch. The interface processor is either connected 

dently compressed and since the parallel file system to a host station 11, preferably through a SCSI-2 target 

supports the storage of extents having a variable size 30 interface 17 or directly connected to a network 19 with a 

within the same file the advantages mentioned in point network interface such as Ethernet. FDDI or ATM broad - 

(3) apply in the case of compressed image files. band ISDN. 

The server interface prnce^or^njfc^ [he client co mputer, 
BRIEF DESCRIPTION OF THE DRAWINGS or remotely located c^ r nt j|« f^rvinfts whirh. rnm- 
FIG. 1 is a block diagram illustrating an example of a 15 prisejile storage^le^access, file transfoiniaiiaB^nxo- 
parallel image and multiple media server. cessing. an^flle recovery operations. File storage and access 
- , . - . . 4 c services are offered by the underlying parallel file system. 
FIG. 2 shows an example of services running on interface ^ m fik m ^ a me ^Vr- proofs 23 
processor 28 and on disk node processors 29. qq my Qne Qf ^ proccssore >r cxamplc ; hc 
FTG. 3 shows an example of a server interface process 20 mterface p^cSsor^M, and extent server processes 24 run- 
interacting with a client process and with the file system's ning on the disk node processors 29 (FIG. 2). As'snown in 
file and extent server processes. FIG. 3, the file server process Receives from the server 
FIG. 4 gives a preferred embodiment of the disk layout of interface proces s. 31 interface lirx arv_32^ fiIe, creation f fi le 
the parallel file system's data structures. opening and file closing commands, and interacts with 
FIGS. 5a, 5b and 5c give examples showing how 25 extent server processes 34 to create new file entries in the 
1 -dimensional, 2-dimensional and 3-dimensional files are D/rectory and in the Distribution index Block Table 
segmented into extents. described below. Extent server processes run on the disk 
FIG. 6 gives examples of how extent sizes influence the node processors 29 and receive commands either from the 
distribution of extents to disks: a well chosen extent size is 30 file P rocess 23 or f rom *° server interface process 21. 
shown in FIG. 6a which provides quite uniform extent Disk node processors 29 which run extent-server processes 
access patterns within a given visualization window and a 24 and slave operation processes 27 are used for parallel file 
badly chosen extent size is shown in FIG. 6b which induces stora S e and access Unctions, for extracting in parallel visu- 
a non-balanced extent access pattern. alization windows from extents, and for applymg in parallel 
„ , . . ,. ... geometric transformation sor image pro cessing ope rations to 

FIG. 7 shows an example of a 3-dimensional window 70 35 • , ♦ A c *u a- 1 ^ 

, . „ . r . . image data read from the disks. 

located in a 3-dimensional image file. ^ - * ^ ■■ » ■ 

™™ « j 01 ■ r ^ . . Disks are members of a pool. As shown below, the pool 

FIGS. 8a and 8b give example of a 2-dm.ensional unage membershi of a ^ can ^ identified by analyzing its 

file segmented into extents striped onto 5 disks, covered by ^ ^ ^ ^ of ^ ^ 
a 2-dimensional file window 81. 

FIG. 8b shows that extent windows 82 have to be 40 Preferred layout of file structures on disks 
accessed from the disks in order to obtain all extent parts 

covered by the 2-dimensional window ^ P^^ 1 me svstem uses the blowing preferred data 

Jrt . ir^i structures for the layout of information on the disks and for 

FIGS. 9a, 9b and 9c show an example of a file providing easy access t0 files striped on multiple 

transformation, where a rectangular window covered by 45 ^ (p f G ^ , n £ ^ of each ^ , disk ^ 

several extents » to be rotated by a specified rotauon angle. ^ ft ^ ^ 

DETAILED DESCRIPTION OF THE identifier, the size of the disk, its type, the name of the disk 

PREFERRED EMBODIMENTS P°°^ * l ^longs t0 > me disk's logical number within the pool 

and pointers 42 to the Block Allocation Table (BAT), to the 

Current apparatuses for storing, accessing and processing 50 Directory (DIR) and to the Distribution Index Block Table 

of image data are based on workstations connected to RAID (DIBTable). The second part of each disk contains the Block 

disk arrays [Chen90]. RAID disk arrays connected to single Allocation Table (BAT) 43 made of a bitmap, where each 

processor workstations do offer higher storage capabilities block of the disk is represented by one bit, this bit being 0, 

as well as a higher throughput than single disks. However respectively 1 depending on whether the corresponding 

RAID disk arrays to not incorporate processing power for 55 block is free, respectively allocated. The third part of each 

performing image scaling operations, image window extrac- disk contains the Directory (DIR) 44. Each entry in the 

tion operations, image transformations or image processing directory contains one file name and a pointer to the corre- 

operations. Such operations must be executed by the host sponding Distribution Information Block (DIB) in the Dis- 

processor. tribution Information Block Table. The fourth part of each 

The invented parallel storage server apparatus (FIG. 1) 60 disk contains the Distribution Information Block Table 

presents a cheaper alternative to image and multiple media (DIBTable) 45, Each table entry is associated with one 

storage devices based on workstation and RAID disk arrays. Directory entry. An entry in the Distribution Information 

It includes a server interface processor 12 interfacing the Block Table contains information which is generally found 

storage system with a client computer 11 (host computer) or in index blocks of file systems and file image related 

with a network 19, an array of intelligent disk nodes 18, each 65 information such as the image size in each dimension, the 

disk node being composed of one processor 13 and at least extent size in each dimension, the number of disks used by 

one disk 14, and an interconnection network 16 for con- the specified file, a disk table specifying the disk identifiers 
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of the used disks and, for each used disk, a pointer 46 to a the number of disks (FIG. 6A and 6B). Expressed 

File Local Extent Index Table (FLEIB). The DIB also mathematically, the greatest common denominator of the 

contains a field for describing and pointing 47 to optional file absolute value of this difference and the number of disks 

metadata 48 (color palette, size of compressed file-extents). should be one: 

This field specifies the disk where the metadata is located, 5 

the meUdata^sjzejn^ GCDdDiskNumbcr(ExtcntNumbcrEIx,yI>-DiskNumber(Extciit- 

metadata_is_stored. Metadata is stored in continuous disk Numbcilx^+il^NumberOfDisks)-.! (3) 

block locations. ^ e difference between an extent number and the extent 

Space for the Block Allocation Table (BAT), the Directory number Qne row bdow it caQ ^ varied b changing the 
(DIR), and the Distribution Index Block Table (DIBTable) 

extent size in the horizontal direction (extent width). Reduc- 

allocated at pool initialization time. The Disklnfo BAT ,DIR iQg Respectively enlarging the extent width increases, 

and DIB disk spaces are replicated on each of the disks of res ^ ctively re duces the difference between extent numbers 

the pool. This enables information about the pool to be 0 f extents lying one below the other. As an example, FIG. 6a 

recovered from any disk. At pool initialization, the file incor p 0rates a fi i e ^th 11 extents per'extent row and FIG. 

server process initializes its list of directory entries. For each is fib ^ m& with 12 exte nte.per extent row. .According to 

file there is File Local Extent Index Table (FLEIB) on each eouatic^), reduction or edargemenLpf ^extent^width 

disk on which the file is striped. The FLEIB table includes ma kes,iUpossibl e to cha nfie^h^offset^bj Sveen t he disk 

for each locally stored extent a pointer to its starting disk n umbc r of extentsly inl^e belowJhe_other. FIG. 6a shows 

block and the effective extent size in bytes. Extents are ^^[^^^^S^^:SS^^^^o&cX 

stored in continuous disk block locations. 20 is 2fpr 3 94isk^. FIG. 6b showlan exam plTof a bad jy cEgsen 

At file creation time, a directory and a DIB entry and, on ^tent size where the diskoffset is 3 for_9_disks. thus not 

each contributing disk, a file local extent index table fu ifUlinglhe condition .me ntioaed, in ..frymul _a j _(3) . 

(FLEIB) are created. At file writing time, disk blocks are Experiments and simulations have shownjhat for disks 

allocated for each extent to be written on that disk. The havmg^aJhroughpm^oXjLtoJLM extent 

pointer to the first disk block of each extent on the disk is 2 size^shouldTe^oserTsqls^ contain an amount of data 

placed at the corresponding location of the FLEIB table. between 50 Kbytes al^^ the size 

in bytes of an extent is obtained b y^multiplv jng^epixel size 

Segmentation of image files and disk striping m bytes by th e extent dimensionVin all p'res^nTdin^nsions. 

m£ ^ 0 ^ 3Q For example, the size of a 3-dimensional extent is computed 

FIGS. 5a, 5b and 5c show examples of one-dimensional, as follows: 
2-dimensional and 3-dimensional image files segmented 

into one-dimensional, 2-dimensional and 3-dimensional ExtcatSi2einBytes=ExtcDtSizeinX*Extcntsi2clnY*ExtcntSizeinZ* 

extents. Extents are numbered first in the x direction, then in NumberOfflytesPerPixel. 

the y direction and then in the z direction. For example, a 35 ^ fik prQcess enables the miliali2ation a , disk 

3 : ojmens i onaJ i ^ x j made 0 f a freely chosen number ^ djs3T5o~deTand of 

direction, the ithTxtent in y direction, and the kth extent m a^b^fdisks per disk nodeTa^nJbelween 1 and 7 or 

z direction gets the following ExtentNumber: between 1 and 15 depending on the type of SCSI bus 

interface used. It enables the creation of one-dimensional, 

ExtentNumbcrfij^-knNumbcrOfExtcntsinY^NbOfExtcntsi^ 40 2.dimensional or 3-dimensional files by specifying their 

j'NbOffixtentsinX+i (i) final size in each dimension, their pixe^size in bytes, their 

extent size in each dimension, the fixed size or variable 

where NbOfExtentsInX and NumberOfExtentsIlnY are the extent size mode in which the file is to be created, the size 

number of extents required to cover respectively the width 0 f tne space to be reserved for metadata, the number of disks 

(x-axis) and the height (y-axis) of one plane of the consid- 45 and the disk table specifying theudentities of.me.CQntribut- 

ered 3-d pixmap image. FIGS. 5a, 5b and 5c give examples djstejnjhe pool. In order to obtain good performances, 

of how extents are numbered. thedisks on which a given file is stri ped shoulcLpreferably 

Extents are striped oqtQ a fiufrsf.t nf inn disks, available be located-in^i ferent disk n odes, 

with in, lhe-_dislejool (contributing disks). The number of „ ~~~~ T~ „ 

c^bufing disks (NumberOfDisks parameter) as well as 50 ParaUel file s y stem mterfiloe llbrary P rocedures 

the disk identifiers on which the file is stored are specified The server interface process 21 accesses the functions 

at file creation time. offered by the parallel file system through library procedures 

Tne extents are striped according to their extent number of the interface library 22. There^is a library procedure for 

on the contributing disks, as specified by the following creating aJ51e of a g iven size ineach dim ejisiQru-witli.a. given 

formula: 55 e xtent size _in each dimension. This file creation library 

procedure transmits the file creation command to the file 
server process, which finds a free entry in its directory entry 

DiskNumbct<ExtcntNumber)-ExtcntNumber modulo Number- Ustj and f orwar d s the file creation req uesLt o extent se rver 

u processes, specifying tKe~3irectory entry number of the new 

At fil e_creatioD time, any extent size having an inte ger 60 file, 

numhexof pixels in each dim ension can be chosen . Extents A library procedure exists for reading a 1-dimensional, 

may cover a la rger space than t he space containing the ima ge 2-dimensional or 3-dimensional pixmap image window 

file jnformation (see FIG75ATSB and 5C). In order to obtain from an existing file. The pixmap image window read from 

a balanced distribution of extents within a gi ven image the file covers a freely specified window of the full file 

visuahzajipn .window, the DiskNumber of one extent(6l) and 65 space. In order to read a window from an image file striped 

the'diskNumber of the extent located one extent row below on several disks, the reading procedure performs the fol- 

62 should have a difference (disk offset) which is prime to lowing steps: 
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(1) it computes the number of identifying every extent sponding FLEIB table entries. Each FLEIB entry also con- 
contributing to the required image file window; tains a field specifying the effective compressed extent size 

(2) for every contributing extent, it computes the part in bytes. A library procedure exists for writing a compressed 
covered by the required image file window 81 (extent image file to a file previously created in variable extent 
window); 5 mode. The parameters to be transferred by the server inter- 

(3) it computes the disk number of each contributing face P 10 ** 5 * are a table specifying for each compressed 
extent according to formula (2); extent its size m bvtes ™ d a P ointer t0 its memory location. 

(4) from the disk table stored in the DIB, it computes the ^ library procedure sends complete extent writing 
disk identifier of each disk number; requests o the extent server processes Extent server pro- 

^ „ , „ , . , 10 cesses allocate the disk blocks required for writing their 

(5) for each requested extent, it sends an extent window wfite ^ ex(en|s 0Q ^ b , ocks and date |he 

read request to the extent server running on the disk correspoQding FLEIB table entries. A similar library proce- 

node connected to the disk storing the extent; dufe e)dsts for reading compressed image flles from , he 

(6) it receives the required extent windows from the fafe. If an aligned window is requested by the reading 
extent servers; 15 procedure, only the complete extents covered by the aligned 

(7) it assembles all extent windows into the required window are requested from extent server processes, 
image file window and returns it to the calling program. ... . „, , . , 

A library procedure for writing an image file window into Applying operations to file data striped onto disks 

a previously created image file exists and works similarly to One of the advantages of the invented multiprocessor- 

the image file window reading procedure, with the difference 20 multidisk server is its inherent ability to perform in parallel 

that extent windows are sent to the extent servers, that extent operations on data read from the disks. The server comprises 

servers must access their disks to read the complete corre- an operation master process 26 located in the server inter- 

sponding extents, write into them the extent windows and face processor 28 and operation slave processes 27 located 

write back the complete extents to the disks. on disk node processing units 29. The server interface 

A library procedure exists for accessing and scaling-down 25 process may require an operation to be applied to a given 

an image file window into a smaller window (zooming). The file. It transmits to the operation master process the name of 

procedure works in a similar way as the image file window the file, parameters specifying the type of the operation to be 

reading procedure, the only difference being that the con- performed, the data domain (for example, the image file 

tributing extent servers are asked to subsample their extent window) on which it is to be performed and the form in 

window obtained from the disks and to send the subsampled 30 which the results are to be given back, 

extent windows to the library procedure which assembles all As an example of an operation, the rotation of oblique 

subsampled extent windows into the required scaled-down rectangular windows lying in 2-dimensional pixmap images 

image window. is described. The server interface process specifies to the 

Library procedures exist for reading and writing aligned underlying master operation process an oblique rectangular 

image windows. Since aligned image windows are aligned 35 window within a given image file. The rotating task consists 

with extent boudaries, the procedures interact with the of rotating the image part lying within the specified rectan- 

extent server processes in order to read or to write complete gular window by a certain rotation angle so that its sides 

extents. In the case of a file opened in variable extent size become aligned with the horizontal and vertical axes. The 

mode, requests for writing complete extents include param- master process opens the corresponding file and gets its DIB. 

eters specifying the extent sizes. Similarly a request for 40 With the information in the DIB, it computes which are the 

reading complete extents also returns the effective size of file extents covered by the rectangular window. It also 

each extent. computes the continuous boundary of the oblique rectangu- 

Library procedures exist for writing and reading metadata lar window within each extent. File reading and operation 

from files. There are also library procedures for closing a commands are transmitted to the slave operation processes 

file, for deleting a file and for listing the files present in the 45 residing on the disk node processors. Each slave operation 

directory (DIR). process is asked to read those extents contributing to the 

Further library procedures exist for writing and reading oblique rectangular window, which are stored on the disks 
compressed files, for applying operations on file data, for controlled by its disk node processor. Slave operation pro- 
creating redundancy files and for the recovery of files from cesses must also read additional pixel lines or columns from 
a single disk crash. The functions executed by these library 50 neighbouring extents in order to be able to perform resam- 
procedures are described in the description which follows. phng and interpolation operations at their extents' boundary. 

r „. The operation they are asked to apply to each concerned 

Storage of compressed image files eJ£tent consisls of rotating by the required roUtion angle the 

The invented storage server also stores compressed image continuous boundary obtained by intersecting the extent 

files. The content of each image file extent is separately 55 boundary 91 with the oblique window boundary 92. This 

compressed, preferably at the client location. A compressed rotated boundary contains the contribution of the current 

image file contains metadata which specifies the compressed extent to the whole rotated window 93. A target pixmap 94 

size in bytes of each extent. The file system supports the circumscribing said rotated boundary is defined by its width 

storage of extents having a variable size. When a file is 96, height 97 and by the relative position 98 of said rotated 

created in variable size extent mode, the DIB will contain 60 boundary. This target pixmap is scanned pixel by pixel, 

information relative to the corresponding uncompressed file scanline by scanline by the slave operation process. Pixels 

(image size, extent size, number of disks, disk table). In the whose pixel centers are lying outside the rotated boundary 

same way as for uncompressed image files, FLEIB tables are are set to 0 (black). The position in the non-rotated original 

generated at file creation time. In variable size extent mode, pixmap file extent corresponding to a pixel center lying in 

one set of continuous disk blocks is allocated dynamically 65 the target pixmap inside said boundary is found by inverse 

when a compressed extent is to be written to a disk. The rotation or by an equivalent incremental operation 

block address of the written extents is stored in the corre- [Hersch85]. In order to compute its intensity, bi-linear 
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interpolation is applied to the intensities of its 4 direct 
neighbour pixels in the non-rotated pixmap file extent. Once 
the full target pixmap has been scanned and one value 
associated with each pixel, it is sent to the master operation 
process together with the parameters specifying the vertices 5 
of said rotated boundary and specifying the location of a 
specific bitmap position (for example the upper-left pixel 
center) in the global image coordinate system. The master 
operation process receives the resulting target pixmaps from 
all contributing slave operation processes. It assembles the 10 
resulting rotated window subparts by superposing the target 
pixmaps associated with each of the contributing extents. It 
transmits the resulting rotated window to the requesting 
server interface process. Since all disk-node processors 
which have extents covered by the specified oblique window 15 
contribute in parallel to the window rotation process, the 
result is obtained in a much shorter time than if this 
operation had been done on a workstation hooked to a RAID 
disk array. 

20 

Redundancy 

Existing RAID redundancy methods [Chen90]require 
generating redundant disk blocks for every file writing 
operation. Writing operations are slowed down and disk 
space is required for storing the redundancy information, 25 
even if the data to be made redundant is already backed up 
in archive memory. 

The invented storage server comprises a cheaper alterna- 
tive and a more flexible redundancy scheme. At first, data 3Q 
files are stored without any redundancy. On request, a 
background redundancy file creation process is run to create 
for each specified data file one redundancy file. Each redun- 
dancy file is stored on different disks than its corresponding 
data files. Thanks to the parity extents generation method 35 
described below, data files for which redundancy files have 
been generated can be recovered in the case of single disk 
failures. This redundancy scheme works due to the fact that 
a data file can be striped onto a specific subset of the disks 
of the pool, allowing its redundancy file to be stored on a 
complementary disk subset of the pool. 

The redundancy file contains one parity extent for each set 
of subsequent extents of the original file striped into different 
disks. Let us assume that a data file is composed of k extents 
striped over d disks. The first parity extent of the redundancy 45 
file is obtained by computing for each of its bytes the parity 
of the corresponding bytes from the first d data file extents 
0 to d-1. The so-generated parity extent is the first extent of 
the redundancy file. The jth parity extent is obtained from 
data file extents j*d to (j*d)+d-l. The redundancy file 50 
contains k+d-1 DIV d parity extents stored on a set of disks 
on which no extent of the original data file has been stored. 

The invented redundancy scheme enables data files to be 
recovered in the case of single disk crashes. It has the 
advantage of being optional, thus saving disk space by 55 
avoiding the creation of redundancy files in the case of data 
files already saved on archive devices. Furthermore, since 
the redundancy files are separated from data files, they may 
be generated by a background redundancy file creation 
process 20 active only during the idle time of the storage $0 
server. 

File recovery requests in the case of single disk crashes 
are executed by a file recovery process 25 locating the failed 
disk, opening the data file and its corresponding redundancy 
file, reading for each set of data extents stored on different 65 
disks those extents stored on valid disks as well as the 
corresponding parity extent, computing from corresponding 
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data extents and parity extents the extents located on the 
failed disk and transmitting the recovered file to the request- 
ing process or storing it on a set valid disks. 

While the invention has been described with reference to 
a preferred embodiment thereof, it will be understood by 
those skilled in the art that various changes in form and 
detail may be made without departing from the scope and 
spirit of the invention. 

What is claimed is: 

1. A storage and processing server computing system 
apparatus for offering processing services including[:] par- 
allel file storage, access, and processing functions, compris- 
ing: 

(a) a server interface connected by a communication 
means to at least one computer; 

(b) disk nodes[.], said disk nodes each comprising a disk 
node processor and at least one disk: 

(c) an interconnection network between said server inter- 
face and disk nodes whereby: said server interface is 
operable for receiving service requests from computers, 
further operable for analyzing, processing and decom- 
posing service requests into storage, access and pro- 
cessing commands and further operable for transmit- 
ting them to said disk node processors which are 
operable for receiving commands, further operable for 
executing commands and further operable for returning 
results, whereby an operation is selected from the 
group of geometric transformations and image process- 
ing operations. 

[2. The storage and processing server apparatus defined in 
claim 1, wherein service requests from remotely located 
client computers comprise 

(a) 1 -dimensional, 2-dimensional and 3-dimensional file 
storage and access requests; 

(b) file transformation and processing requests; 

(c) redundancy file creation requests; 

(d) file recovery requests; wherein 

(i) file storage and access requests comprise file 
creating, opening, and deleting commands and file 
extent storage and access commands executed in 
parallel by disk node processors; 

(ii) file transformation and processing requests are 
decomposed into file part transformation and pro- 
cessing operations executed in parallel by disk node 
processors; 

(in) redundancy file creation requests are executed by 
generating from data file extents parity file extents 
which are stored on different disks from the disks 
storing the corresponding data file extents; 

(iv) file recovery requests in the case of single disk 
crashes are executed by reading valid data extents 
from the file to be recovered and redundant parity 
extents from its corresponding redundancy file, by 
computing the lacking data extents and by writing 
the recovered file on valid disks.] 
3. The storage and processing server apparatus defined in 
claim 2, wherein file storage and access requests comprise 

(a) file window storage and access commands, decom- 
posed into extent window storage and access com- 
mands executed in parallel by disk node processors; 

(b) scaled-down file window storage and access 
commands, decomposed into scaled-down extent win- 
dow storage and access commands executed in parallel 
by disk node processors; 

(c) file aligned window access commands, decomposed 
into complete extent storage and access commands 
executed in parallel by disk node processors. 
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4. The storage and processing server apparatus defined in 
claim 3, where aligned window access commands are 
decomposed into variable size extent storage and access 
commands executed in parallel by disk node processors. 

5. The storage and processing server apparatus defined in 5 
claim 2, wherein file transformation and processing opera- 
tions comprise rotation operations executed in parallel by 
disk node processors. 

6. A parallel file storage, access and processing method 
comprising the steps [of] of: 10 

(a) receiving storage, access and processing requests from 
a client process; 

(b) decomposing file storage, access and processing 
requests into a plurality of access, storage and process- 
ing requests executable by disk node [processors:] 15 
processors; 

(c) sending said plurality of access, storage and process- 
ing requests to disk node [processors:] processors; 

(d) receiving in parallel said plurality of access, storage 2 o 
and processing requests by disk node processors; 

(e) executing in parallel said plurality of requests includ- 
ing an operation selected from the group of geometric 
transformations and image processing operations by 
accessing disks block on a plurality of disks and by 25 
applying processing operations executed by a plurality 

of processors; 

(f) assembling the results obtained by the parallel execu- 
tion of said requests into one [result:] result; 

(g) returning said result to the requesting client process. 30 
[7. The parallel file storage, access and processing method 

defined in claim 6, wherein files are multidimensional files 
made of elementary elements, each element being composed 
of at least one byte, wherein multidimensional files are 
segmented into multidimensional extents, wherein extents 
are incrementally numbered as is shown in the case of the 
following 3-dimensional extent positioned as the ith extent 
in the x direction, the jth extent in y direction, and the kth 
extent in z direction according to the formula ^ 

ExtentNumber(ij > k)-k*(NbOfExtcntsrnY*NbOfExtcntsInX)+ 
j"NbOfExtentsInX+i, 

wherein extents are striped into a set of available d disks 
according to the following formula 45 
DiskNumber(ExtentNumber)=ExtentNumber MODULO 
d, 

where NbOfExtentsInX and NbOfExtentsInY are the num- 
ber of extents required to cover respectively the width 5Q 
(x-axis) and the height (y-axis) of one plane of the consid- 
ered 3-d pixmap image.] 

8. A multiprocessor-multidisk storage server offering par- 
allel file storage, access, and processing functions compris- 

(a) a server interface comprising at least one local pro- 
cessor connected by a communication means to at least 
one computer; 

(b) disk nodes, said disk nodes each comprising a disk 
node processor and at least one disk, said disk being &q 
operable for storing data; 

(c) an interconnection network between said server inter- 
face and disk nodes, whereby: said server interface is 
operable for receiving service requests from computers, 
further operable for analyzing, processing and decom- 65 
posing service requests into storage, access and pro- 
cessing commands and further operable for transmit- 
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ting them to said disk node processors which are 
operable for receiving commands, further operable for 
executing commands including an operation selected 
from the group of geometric transformations and image 
processing operations and further operable for return- 
ing results. 

9. A multiprocessor multidisk storage server of claim 8, 
where the interconnection network between the server inter- 
face and the disk nodes comprises a crossbar switch. 

10. A multiprocessor multidisk storage server of claim 8, 
where the number of disk nodes and the number of disks per 
disk node [can be] are independendy chosen according to 
[the requirements of the] client application requirements. 

11. A multiprocessor multidisk storage server of claim 8, 
where aligned window access commands are decomposed 
into variable size extent storage and access commands 
executed in parallel by disk node processors. 

12. A multiprocessor multidisk storage server of claim 8, 
where file transformation and processing operations com- 
prise rotation operations executed in parallel by disk node 
processors. 

13. A multiprocessor multidisk storage server of claim 8 
where file storage and access requests comprise: 

(a) file window storage and access commands, decom- 
posed into extent window storage and access com- 
mands executed in parallel by disk node processors; 

(b) scaled-down file window storage and access 
commands, decomposed into scaled-down extent win- 
dow storage and access commands executed in parallel 
by disk node processors; 

(c) file aligned window access commands, decomposed 
into complete extent storage and access commands 
executed in parallel by disk node processors. 

14. A parallel file storage, access and processing method 
using processes running on a server interface processor and 
processes running on disk node processing units, the method 
comprising the steps: 

(a) receiving by the processes running on the server 
interface processor storage, access and processing 
requests from a client process; 

(b) decomposing file storage, access and processing 
requests into a plurality of access, storage and process- 
ing requests executable by processes running on disk 
node [processors:] processors; 

(c) sending said plurality of access, storage and process- 
ing requests to processes running on disk node [pro- 
cessors:] processors; 

(d) receiving in parallel said plurality of access, storage 
and processing requests by processes running on disk 
node [processors:] processors; 

(e) executing in parallel said plurality of requests includ- 
ing an operation selected from the group of geometric 
transformations and image processing operations by 
accessing disk data on a plurality of disks and by 
applying processing operations executed by processes 
running on a plurality of [processors:] processors; 

(f) assembling the results obtained by the parallel execu- 
tion of said requests into one result; and 

(g) returning said result to the requesting client process. 
[15. A parallel file storage, access and processing method 

of claim 14, where processes running in a server interface 
processor comprise a server interface process and at least 
one file server process and where processes running on disk 
node processing units comprise extent server processes, said 
server interface process being responsible for receiving file 



04/20/2004, EAST Version: 1.4.1 



US RE38 

15 

storage, access and processing requests and for transmitting 
file creation, opening and closing requests to the file server 
process and for transmitting file access requests to the extent 
server processes, said file server process being responsible 
for interacting with the extent server processes for creating 5 
new file entries in a file directory and in a distribution index 
block table, said distribution index table comprising file 
dimensionality information, file size information and file 
extent size information and comprising, for each disk on 
which a file is stored, a pointer to its file local extent table, 10 
said file local extent table comprising for each locally stored 
extent of a file the address of its starting block and its 
effective extent size.] 

16. A parallel file storage, access and processing method 

of claim 14, where files are multidimensional files made of 15 
elementary elements, each element being composed of at 
least one byte, where multidimensional files are segmented 
into multidimensional extents, where extents are incremen- 
tally numbered as is shown in the case of the following 
3 -dimensional extent positioned as the I th extent in the x 20 
direction, [thej^] the f h extent in y direction, and the 
extent in z direction according to the formula: 

ExtentNumber(ij ) k)»k*(NbOfExtcntsInY*NbOfExtentsInX)+ . 
j*NbO£ExtentslnX+i, 

wherein extents are striped into a set of available d disks 
according to the following formula: 

30 

DiskNumber(ExtentNumber)-ExtentNumber MODULO d, 

where NbOfExtentsInX and NbOfExtentsInY are the num- 
ber of extents required to cover respectively the width 
(x-axis) and the height (y-axis) of one plane of the consid- 
ered 3nd pixmap image. 35 

17. A multiprocessor parallel file system operable for 
storing and accessing in parallel data files striped onto 
multiple disks, said data files being striped onto the disks 
according to their dimensionality which ranges from 1 to 3, 
the multiprocessor parallel file system comprising: 40 

(a) a file server process operable for receiving file 
creation, opening and closing requests and operable for 
interaction with extent server processes regarding the 
creation and deletion of directory entries and distribu- 45 
tion information block table entries; 

(b) extent server processes running on disk node proces- 
sors operable for receiving file extent storage and 
access requests and for receiving from the file server 
process commands for the creation and deletion of 50 
directory entries and distribution information block 
table entries; and 

c) parallel file system interface library procedures for file 
creation, file opening, file access and file closing. 

18. A parallel file system of claim 17, where as a means 55 
for efficient parallel file access, the size of the extents striped 
onto the disk at storage time is computed by varying the 
extent width so as to make the offset between the respective 
disk numbers of an extent lying one extent row below 
another extent prime in respect to the number of disks. go 

19. A multiprocessor-multidisk storage server offering 
parallel file storage, access, and processing functions com- 
prising: 

(a) a server interface processor connected by a commu- 
nication means to at least one computer, said server 65 
interface processor being operable for receiving service 
requests from computers, further operable for 
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analyzing, processing and decomposing service 
requests into storage, access and processing commands 
and further operable for transmitting them to disk node 
processors; 

(b) disk nodes, said disk nodes each comprising a disk 
node processor and at least one disk, said disk node 
processor being operable for receiving commands, fur- 
ther operable for executing processing commands com- 
prising operations selected from the group of geometric 
transformations and image processing operations; 

(c) an interconnection network [:]between said server 
interface and disk nodes. 

20. A multiprocessor parallel file system operable for 
storing, accessing and operating in parallel on data files 
striped onto multiple disks, the multiprocessor parallel file 
system comprising: 

(a) a file server process operable for receiving file 
creation, opening and closing requests and operable for 
communicating with extent server processes regarding 
the creation and deletion of directory, entries and 
distribution information block table entries; 

(b) extent server processes running on disk node proces- 
sors operable for receiving file extent storage and 
access requests and for receiving from the file server 
process commands for the creation and deletion of 
directory entries and of distribution information block 
table entries; 

(c) operation slave processes running on disk node pro- 
cessors operable for receiving file reading commands 
and file operation commands, for executing them and 
for returning results; 

(c) an operation master process operable for receiving file 
access and file operating requests, operable for decom- 
posing said file access and file operating requests into 
file reading commands and into file operating 
commands, for transmitting said commands to opera- 
tion slave processes and for receiving the results from 
said operation slave processes; 

(d) a server interface process running on a server interface 
processor, the server interface process being operable 
for receiving service requests from client computers, 
further operable for analyzing, processing and decom- 
posing service requests into file storage, access and file 
operating commands and for transmitting them to the 
file server, extent server and operation master pro- 
cesses. 

21. A multiprocessor-multidisk storage sever offering 
parallel file storage, access, and processing functions com- 
prising: 

(a) a server interface comprising at least one local pro- 
cessor connected by a communication means to at least 
one computer; 

(b) disk nodes, said disk nodes each comprising a disk 
node processor and at least one disk, said disk being 
operable for storing data; 

(c) an interconnection network between said server inter- 
face and disk nodes whereby said server interface is 
operable for receiving service requests from computers, 
further operable for analyzing, processing and decom- 
posing service requests into storage, access and pro- 
cessing commands and further operable for transmit- 
ting them to said disk node processors which are 
operable for receiving commands, further operable for 
executing commands and further operable for returning 
results; and where service requests from remotely 
located client computers comprise: 
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(A) file storage and access requests, where the dime- 
sionality of the file is selected from [The] the group 
of 1 -dimension, 2-dimensions and 3 -dimensions and 
where extents are striped onto the disks at file storage 
time according to [The] the dimensionality of the 
file; 

(B) file transformation and processing requests; 
wherein: 

(i) file storage and access requests comprise file 
creating, opening, anti,deleting commands and file 
extent storage and access commands executed in 
parallel by disk node processors; 

(ii) file transformation and processing requests are 
decomposed into file part transformation anti pro- 
cessing operations executed in parallel by disk 
node processors. 

22. A multiprocessor multidisk storage server of claim 
[21] 21, where service requests further comprise: 

(a) redundancy file creation requests; 

(b) file recovery requests; wherein 

(i) redundancy file creation requests are executed by 
generating from data file extents parity file extents 
which are stored on different disks from the disks 
storing the corresponding data file extents; 

(ii) file recovery requests in the case of single disk 
crashes are executed by reading valid data extents 
from the file robe recovered and redundant parity 
extents from its corresponding redundancy file, by 
computing the lacking data extents and by writing 
the recovered file on valid disks. 

23. A multiprocessor multidisk storage server of claim 
[21] 21, where, as a means for efficient parallel file access, 
the size of the extents striped onto the disk at storage time 
is computed by varying the extent width so as to make the 
offset between the respective disk numbers of an extent lying 
one extent row below another extent prime in respect to the 
number of disks. 

24. A parallel fie storage, access and processing method 
using processes running on a server interface processor and 
processes running on disk node processing units, the method 
comprising the steps of: 

(a) receiving by the processes running on the server 
interface processor storage, access and processing 
requests from a client process; 

(b) decomposing file storage, access and processing 
requests into a plurality of access, storage and process- 
ing requests executable by processes running on disk 
node processors; 

(c) sending said plurality of access storage and processing 
requests to processes running on disk node[:] proces- 
sors; 

(d) receiving in parallel[:] said plurality of access, storage 
and processing requests by processes running on disk 
node processors; 

(e) executing in parallel said plurality of requests by 
accessing disk data on a plurality of disks and by 
applying processing operations executed by process as 
running on a plurality of processors; 

(f) assembling the results obtained by the parallel execu- 
tion of said requests into one result; 

(g) returning said result to the requesting client process, 
wherein 

(i) files are composed of extents, each extent compris- 
ing at least one elementary element, each elementary 
element being composed of at least one byte;[J 
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(ii) decomposing file storage, access and processing 
requests comprises the operations of segmenting 
files into extents; 

(iii) the processing operations executed by processes 
running on a plurality of processors comprise an 
operation selected from the group of geometric trans- 
formations and image processing operations. 

25. A parallel file storage, access and processing method 
of claim 24, where multidimensional files are segmented 
into multidimensional extents, where extents are incremen- 
tally numbered as is shown in the ease of the following 
3-dimensional extent positioned as the X th extent in the x 
direction, the J 1 * extent in y direction, and the extent in 
z direction according to the formula[:] 



15 



ExtentNumbcrCij^k^CNOfExtentsInY'NOfExtcntsInX)* 
j'NbOExtentsInX+i, 

wherein extents are striped into a set of available d disks 
according to the following formula 

DiskNumber(ExtentNumber)=ExtentNumber MODULO 
d, 

where NbOfExtentsInX and NbOfExtentsInY are the num- 
ber of extents required to cover respectively the width 
(x-axis) and the height (y-axis) of one plane of the consid- 
ered 3-d pixmap image. 

26. A multiprocessor parallel file system operable for 
storing, accessing and processing in parallel multidimen- 
sional data files striped onto multiple disks as multidimen- 
sional extents, said multidimensional data files being striped 
onto the disks according to their dimensionality which 
ranges from 1 to 3, the multiprocessor parallel file system 
comprising: 

(a) a file server process operable for receiving file 
creation, opening, closing and processing requests and 
operable for interaction with extent server processes 
regarding the creation and deletion of directory entries 
and distribution information block table entries; 

(b) extent server processes running on disk node proces- 
sors operable for receiving file extent storage, access 
and processing operations requests and for receiving 
from the file server processing commands for the 
creation and deletion of directory entries and distribu- 
tion information block table [entries] entries, wherein 
the extent server processes apply the processing opera- 
tion to the multidimensional extents., 

(c) parallel file system interface library procedures for file 
creation, file opening, file access and file [closing;] 
closing, wherein 

(i) [data files are composed of extents,] each multidi- 
mensional extent [comprising] comprises at least one 
elementary element, each elementary element [being 
composed of] comprising at least one [byte:] byte; 
and 

(ii) the processing operations executed by the extent 
server processes [running on a plurality of processors 
comprise an operation] are selected from the group 
of geometric transformations and image processing 

60 operations. 

27. A parallel file system of claim 24, where data files are 
multidimensional files segmented into multidimensional 
extents, where extents are incrementally numbered as is 
shown in the case of the following 3-dimensional extent 

65 positioned as the i th extent in the x direction, the extent 
in y direction, and the k** extent in z direction according to 
the formula 
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operating commands and for transmitting them to the 
file server, extent server and operation master pro- 

Exicn\NumbcT{[i^j,k)mK^(l^OfExtetusInY^bOJExtentsInX)+ cesses; wherein: 

fNbOfExtentsinX+i, q da * a files m com posed of extents, each extent 

wherein extents are striped [Jinto a set of available d disks 5 comprising at least one elementary element, each 

according to the following formula ty™^ composed of at least one 

(ii) decomposing file access requests comprises the 

DiskNumbcr(ExtciitNumb e r)-ExtentNumber MODULO d, operation of segmenting files into extents; and 

10 (iii) file operation commands comprise an operation 

where NbOfExtentsInX and NbOfExtentsInY are the num- selected from the group of geometric transforma- 

ber of extents required to cover respectively with width tions and image processing operations, 

(x-axis) and the height (y-axis) of one plane of the consid- 29. A parallel file storage and processing method offering 

ered 3-d pixmap image. high-bandwidth storage, access and processing of data files 

28. A multiprocessor parallel file system operable for is striped on a plurality of disks, comprising file creation, file 

storing, accessing and operating in parallel on data files reading, file writing, and file processing functions, whereby 

striped onto multiple disks, the multiprocessor parallel file storage, access and processing requests are sent by client 

system comprising: processes located outside the server, received by a server 

(a) a file server process operable for receiving file interface process located on the server interface processor, 
creation, opening and closing requests and operable for 20 analyzed and decomposed into requests directed to a file 
communicating with extent serve processes regarding se™er process, to extent server processes and to operation 
the creation and deletion of directory entries and dis- processes and where files are multidimensional files made of 
tribution information block table entries; elementary elements, each element being composed by one 

/L v A . . j * i j or more bytes, wherein multidimensional files are see- 

(b) extent server processes running on disk node proces- , J y ' ... . 4 t , . t t & 
' . ™„uio fx, _ • r - fitl etAro ^ op1 j 25 mented onto multidimensional extents, wherein extents are 
sors operable tor receiving file extent storage and „ - . r 

access requests and for receiving from the file server ™ rement f ? numbere ? " 1S , sho ?. n m . the , h ca f*° f , th * 

process commands for the creation and deletion of foHowmg 3 dimensional extent positioned as the h extent 

j. t , . , C j. , . . r . t , in the x direction, the J extent in y direction, and the 

directory entnes and of distribution information block . .. t J . 9 

. . , \ - extent in z direction accordmg to the formula 

table entnes; ° 

30 

(c) operation slave processes running on disk node pro- 
cessors operable for receiving file reading commands ExumtNumber(IijJy;A)Hfc*^^^ 
and file operation commands, for executing them and j*NbOExtcntslnX+i, 

for returning results, wherein extents are striped into a set of available d disks 

(d) an operation master process operable for receiving file 35 accor din g to the following formula 

access and file operating requests, operable for decom- D iskNumber(ExtentNumber)=ExtentNumber MODULO 

posing said file access and file operating requests into d 

reading commands and into operating commands, for where'NbOfExtentsInX and NbOjExtentsInY are the num- 

transmitting said commands to ^ operation . slave pro- bef of ired tQ ^ tivel ^ M 

cesses and for receiving the results from said operation 40 the hei ht ( is) of one lane of the consid . 

slave processes; and ^ ^ pixmap ^ 

(e) a server interface process running on a server interface 30. a parallel file storage and processing method in claim 
processor, the server interface process being operable 29, where storage, access and processing of data files striped 
for receiving service requests from client computers, 00 a plurality of disks. 

further operable for analyzing, processing and decom- 45 

posing service requests into file storage, access and file ***** 
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